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ABSTRACT 

We use the clustering results obtained by Madgwick et al. (2003) for a sample of 96,791 
2dF galaxies with redshift 0.01 < z < 0.15 to study the distribution of late-type and 
early-type galaxies within dark matter haloes of different mass. Within the framework 
of our models, galaxies of both classes are found to be as spatially concentrated as the 
dark matter within haloes even though, while the distribution of star-forming galaxies 
can also allow for some steeper profiles, this is drastically ruled out in the case of 
early-type galaxies. We also find evidence for morphological segregation, as late-type 
galaxies appear to be distributed within haloes of mass scales corresponding to groups 
and clusters up to about two virial radii, while passive objects show a preference 
to reside closer to the halo centre. If we assume a broken power-law of the form 
(7Vg a i)(m) = (to/too)" 1 for m cu t < to < too and (N ga x)(m) — (to/too)" 2 at higher 
masses to describe the dependence of the average number of galaxies within haloes on 
the halo mass, fits to the data show that star-forming galaxies start appearing in haloes 
of masses m C ut — 10 toq, much smaller than what is obtained for early- type galaxies 
(?7i cu t — 10 12 ' 6 TOq). In the high-mass regime to > mo, (N sa \) increases with halo 
mass more slowly [a^ ~ 0.7) in the case of late-type galaxies than for passive objects 
which present ct2 — 1.1. The above results imply that late- type galaxies dominate the 
2dF counts at all mass scales. We stress that - at variance with previous statements 
- there is no degeneracy in the determination of the best functional forms for p(r) 
and (-/Vg a i), as they affect the behaviour of the galaxy-galaxy correlation function on 
different scales. 

Key words: galaxies: clustering - galaxies: optical- cosmology: theory - large-scale 
structure - cosmology: observations 



1 INTRODUCTION 

Measurements of the galaxy-galaxy correlation function con- 
tain a wealth of information both on the underlying cosmo- 
logical model and on the physical processes connected with 
the formation and evolution of galaxies. Untangling these 
two effects from the observed clustering signal of a partic- 
ular class of sources is, however, not an easy task. For in- 
stance, the physics of galaxy formation affects the relation- 
ship between the distribution of luminous and dark matter 
(the so-called "bias") so that different types of galaxies are 
expected to exhibit different clustering properties. This has 
indeed been observed (see e.g. Loveday et al., 1995; Guzzo 
et al., 1997; Loveday, Tresse & Maddox, 1999; Magliocchetti 
et al., 2000 just to mention few) in the past decade, when 
large- area surveys started including enough sources to allow 
for precision clustering statistics. 

From a theoretical point of view, the relationship be- 
tween dark matter and galaxy distribution has not yet fully 



been understood since, while the dynamics of dark matter is 
only driven by gravity and fully determined by the choice of 
an appropriate cosmological model (see e.g. Jenkins et al., 
1998), the situation gets increasingly more difficult as one 
tries to model the physical processes playing a role in the 
process of galaxy formation. 

As a first approximation, galaxies can be associated with the 
dark matter haloes in which they reside (in a one-to-one re- 
lationship) , so that their clustering properties can be derived 
within the framework of the halo-model developed by Mo & 
White (1996). Such models have been proved extremely use- 
ful to describe the clustering of high-redshift sources such as 
quasars, Lyman Break and SCUBA galaxies (see e.g. Matar- 
rese et al., 1997; Moscardini et al., 1998; Martini & Wein- 
berg, 2001; Magliocchetti et al., 2001; Porciani & Giavalisco, 
2002) where the assumption of one such object per halo can 
be considered a reasonable guess. 

The validity of this approach however breaks down as one 
moves to objects with higher number densities such as low- 
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redshift galaxies. The distribution of these kind of sources 
within dark matter haloes is in general an unknown quan- 
tity which will depend on the efficiency of galaxy formation 
via some complicated physics connected to processes such as 
gas cooling and/or supernova feedback (see e.g. Somerville 
et al., 2001; Benson ct al., 2001). 

The analytical connection between the distribution of 
sources within dark matter haloes and their clustering prop- 
erties has been studied in detail by a number of recent pa- 
pers (Peacock & Smith, 2000; Seljak, 2000; Scoccimarro et 
al., 2001; Bullock, Wechsler & Somerville 2002; Marinoni 
& Hudson, 2002; Berlind & Weinberg, 2001; Moustakas & 
Somerville, 2002; Yang, Mo & van den Bosch 2003; van den 
Bosch, Yang & Mo, 2003; see also Cooray & Sheth, 2002 
for an extensive review on the topic) which mainly focus on 
the issue of the halo occupation function i.e. the probability 
distribution of the number of galaxies brighter than some lu- 
minosity threshold hosted by a virialized halo of given mass. 
Within this framework, the distribution of galaxies within 
haloes is shown to determine galaxy-galaxy clustering on 
small scales, being responsible for the observed power-law 
behaviour at separations r < 3 Mpc. 

A number of parameters are necessary to describe the 
halo occupation distribution (i.e. the probability of finding 
iVg a i galaxies in a halo of mass m) of a class of galaxies. 
These parameters - expected to vary with galaxy type - 
cannot be worked out from first principles and have to rely 
for their determination either on comparisons with results 
from semi-analytical models or on statistical measurements 
coming from large data-sets, with an obvious preference for 
this second approach. 

The two last-generation 2dF and SDSS Galaxy Redshift Sur- 
veys (Colless et al., 2001; York et al., 2000) come in our 
help since - with their unprecedented precision in measur- 
ing galaxy clustering - they can in principle constrain the 
functional form of the halo occupation distribution (see also 
Zehavi et al., 2003). 

Lower-order clustering measurements such as the spa- 
tial two-point correlation function are already available for 
samples of 2dF sources (Norberg et al., 2001; Hawkins et al., 
2003; Madgwick et al., 2003). In this work we will use the re- 
sults of Madgwick et al. (2003) on the clustering properties 
of late-type and early-type galaxies to investigate possible 
differences in the processes responsible for the birth and evo- 
lution of these two classes of sources. 

The layout of the paper is as follows: Section 2 intro- 
duces the formalism necessary to describe galaxy cluster- 
ing within the halo occupation distribution context. Key 
ingredients for this kind of analysis are the average num- 
ber {N ga \)(m) of galaxies hosted in dark matter haloes of 
mass m, a measure of the spread (N gal (N gal — l))(m) about 
this mean value and the spatial distribution p(r) of galaxies 
within their haloes. In Section 3 we briefly describe the 2dF 
Galaxy Redshift Survey with particular attention devoted 
to measurements of the luminosity function and correlation 
function for early-type (i.e. passively evolving) and late-type 
(which are still in the process of active star formation) galax- 
ies, and derive estimates for the number density of these 
sources. Section 4 presents and discusses our results on the 
distribution of different types of galaxies within dark matter 
haloes as obtained by comparing predictions on their num- 



ber density and correlation function with 2dF observations, 
while Section 5 summarizes our conclusions. 

Unless differently stated, throughout this work we will 
assume that the density parameter fio = 0.3, the vacuum 
energy density A = 0.7, the present-day value of the Hubble 
parameter in units of 100 km/s/Mpc ho = 0.65 and cr 8 = 0.8 
(with as the rms linear density fluctuation within a sphere 
with a radius of 8/i _1 Mpc), as the latest results from the 
joint analysis of CMB and 2dF data seem to indicate (see e.g. 
Lahav et al., 2002; Spergel et al., 2003). Note, however, that 
the normalization of the linear power spectrum of density 
fluctuations is still very controversial: estimates of as from 
either weak-lensing or cluster abundances range between 0.6 
and 1.0, while some analyses of galaxy clustering seem to 
favour values ~ 0.7 (van den Bosch, Mo & Yang 2003). Our 
results will slightly depend on the assumed value for ug. 



2 GALAXY CLUSTERING: THE THEORY 

The purpose of this Section is to introduce the formalism 
and specify the ingredients necessary to describe galaxy 
clustering at the 2-point level. Our approach follows the 
one adopted by Scoccimarro et al. (2001) which is in turn 
based on the analysis performed by Scherrer & Bertschinger 
(1991). In this framework, the galaxy-galaxy correlation 
function can be written as 

£ s (x - x') = £"(x - x') + £f (x - x'), (1) 
with 

£ L g H = ^2 / n(m)(N gaX (N sal - \)){m)dm x 

j Pm(y)pm(y + x - x')d 3 y, (2) 

and 

= 4 / n(mi)(AT ga i)(mi)dmi x (3) 

n 3 J 

J n(m2){N ga _i)(m 2 )dm2 J p mi (x - xi)d 3 :n 

x J Pm 2 ( x ' - x a)C( x i - x 2; mi, m 2 )d 3 X2, 

where the first term ^ h accounts for pairs of galaxies re- 
siding within the same halo, while the ^ h represents the 
contribution coming from galaxies in different haloes. Note 
that all the above quantities are dependent on the redshift 
z, even though we have not made it explicit. 
In the above equations, {N ga ,\)(m) is the mean number of 
galaxies per halo of mass m, and (N ga ,\(N ga ,i — l))(m) - also 
dependent on the mass of the halo hosting the galaxies - is 
a measure of the spread about this mean value. The mean 
comoving number density of galaxies is defined as: 

n 3 = J n(m)(N gal )(m)dm, (4) 

where n{m) is the halo mass function which gives the num- 
ber density of dark matter haloes per unit mass and volume. 
£(xi — X2; mi , 7712) is the two-point cross-correlation function 
between haloes of mass mi and n»2 and, finally, p m (y) is the 



© 0000 RAS, MNRAS 000, 000-000 



The Halo Distribution of 2dF Galaxies 3 



(spatial) density distribution of galaxies within the haloes, 
normalized so to obtain 



rr 
JO 



p m (y)d 3 y = 1, 



(5) 



where r cut is the radius which identifies the outer boundaries 
of the halo. 

From the above discussion it then follows that, in order to 
work out £ s , we need to specify the halo-halo correlation 
function, the halo mass function, the spatial distribution 
of galaxies within the haloes and a functional form for the 
number distribution of galaxies within the haloes. This is 
done as follows. 



2.1 Halo-Halo Correlation Function 

An approximate model for the 2-point correlation function 
of dark-matter haloes can be easily obtained from the mass 
autocorrelation function as (see e.g. Porciani & Giavalisco, 
2002) 

£(r,z,mi,m2) = 

{£dm{r, z)bi(mi,z)b2(m2, z) if r > n + r 2 
— 1 otherwise, 



(6) 



where the above expression takes into account the halo-halo 
spatial exclusion (n = r cutl and r 2 = r cut2 are the Eule- 
rian radii of the collapsed haloes, in general identified with 
their virial radii), and the mass-mass correlation function 
(,dm(r, z) - fully specified for a given cosmological model and 
a chosen normalization of as - has been calculated follow- 
ing the approach of Peacock &: Dodds (1996) which is suf- 
ficiently accurate both in the linear and non-linear regimes. 
In fact, results obtained with the more precise algorithm de- 
veloped by Smith et al. (2003) significantly differ from those 
obtained with the method by Peacock & Dodds (1996) only 
in the regime where the 1-halo term dominates the 2-point 
clustering signal. 

The linear bias factor b(m, z) of individual haloes of 
mass m at redshift z can instead be written as (Sheth & 
Tormen, 1999; see also Cole & Kaiser 1989; Mo & White 
1996; Catelan et al. 1997; Porciani et al. 1998) 



6(m, z) 



1 + 



2p/S c 



1 + (av)P 



(7) 



with p = 0.3, a = 0.707, v = (S c /af, where 5 C ~ 1.686 and 
a respectively are the critical overdensity for collapse and 
the linear rms variance of the power spectrum on the mass 
scale m at redshift z. 

Note that, at variance with previous works (e.g. Pea- 
cock & Smith, 2000; Seljak, 2000; Scoccimarro et al.,2001; 
Berlind & Weinberg, 2001) where the halo-halo correlation 
function was only derived in the linear regime, our equa- 
tion (6) fully accounts for the non-linear evolution of den- 
sity fluctuations. Using linear theory to compute the 2-halo 
term can be considered as a good approximation on large 
scales (r > 5 Mpc) - where the clustering growth is indeed 
still linear - and does not create any problems on small 
scales (r < 1 Mpc) - where the 1-halo term dominates 
the clustering signal. It, however, breaks down at interme- 
diate distances where the £l h and Q h contributions are of 



comparable importance. It follows that the use of a linear 
halo-halo correlation function in equation (3) systematically 
leads to a serious underestimate of the clustering signal (1) 
produced by low- z galaxies on scales 1 < r/[Mpc] < 5, when 
compared with results obtained by taking into account the 
fully non- linear behaviour of ^ m (r, z). For this reason, we 
believe our approach to be more consistent than the ones 
adopted so far. We note that similar models - unknown to 
us till the very last stages of the present paper - have also 
been used by Zehavi et al. (2003), Yang et al. (2003) and 
van den Bosch et al. (2003). 



2.2 Mass Function 

For the analytical expression of the halo mass function we 
rely once again on the Sheth & Tormen (1999) form: 



. . An lav 
n(m,z) = —* — 
m z V 2-7T 



exp 



(-?) 



d(lnu) 



d(ln 



(8) 



(with A = 0.322, p the mean background density and the 
other quantities defined as above) , since it gives an accurate 
fit to the results of N-body simulations with the same initial 
conditions (Jenkins et al. 2001; Sheth, Mo & Tormen 2001). 



2.3 Spatial Distribution of Galaxies 

The first, easiest approach one can take for the spatial dis- 
tribution of galaxies within a halo of specified mass m is to 
assume that galaxies follow the dark matter profile. Under 
this hypothesis we can then write 



Pm{r) = p m (r) • m = p 



fc 3 /S 



cr/r vil (l + cr/r viT ) 2 ' 



(9) 



where we use the Navarro, Frenk & White (1997, here- 
after NFW) expression, which provides a good description 
of the density distribution within virialized haloes in nu- 
merical simulations. In equation (9), r v ; r is the virial ra- 
dius of the halo, related to its mass via m = (47rr5 ir /3) Ap, 
where A (=340 for an Q = 0.3 universe at z = 0) is the 
characteristic density contrast of virialized systems; / = 
A/[ln(l + c) — c/(l + c)], and for the concentration param- 
eter c we use equations (9) and (13) in Bullock et al. (2001). 

Clearly, the assumption for the distribution of galaxies 
within virialized haloes to trace the dark matter profile is not 
necessarily true. For instance, the semi-analytic models by 
Diaferio et al. (1999) suggest that this cannot hold for both 
late-type and early-type galaxies since blue galaxies tend to 
reside in the outer regions of their parent haloes, while red 
galaxies are preferentially found near the halo centre. 
For this reason, in the following analysis we will also con- 
sider spatial distributions of the form p' m (r) oc (r/r v i r )~ /3 , 
with (3 = 2, 2.5, 3, where the first value corresponds to the 
singular isothermal sphere case. 

The last remark concerns the choice for values of r cut 
in equations (5) and (6). All the profiles (both the NFW 
and the power-laws) considered so far formally extend to in- 
finity, leading to divergent values for the associated masses. 
This implies the need to "artificially" truncate the distri- 
bution profiles at some radius r cut . One sensible choice is 
to set r cut = r v i r , since one expects galaxies to form within 
virialized regions, where the overdensity is greater than a 
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certain threshold. However, this might not be the only pos- 
sible choice since for instance - as a consequence of halo-halo 
merging - galaxies might also be found in the outer regions of 
the newly-formed halo, at a distance from the center greater 
than r v ir- 

The way different assumptions for the steepness of 
the profiles and different choices for r cu t affect the galaxy- 
galaxy correlation function on small scales (r < 1 Mpc) is 
presented in Figure (1). To this particular aim, both 
and £,g h have been derived from equation (1) by setting 
(Ngai) = {-/Vg a i(iVg a i — 1)) = 1 for all halo masses greater 
than 10 10 ' 7 ?tiq and otherwise. 

The two panels show the case for r cut = r v i r (top) and 
r cut = 2 -rvir (bottom), while solid, short-dashed, long-short- 
dashed and dotted lines respectively represent the results for 
a NFW, a power-law with j3 = 2.5, a power-law with j3 = 2 
and a power-law with /3 = 3 distribution profiles. Lower 
curves (for r — > 0) correspond to the £g term (contribution 
from objects in different haloes), while the upper ones indi- 
cate the £g h term. 

As Figure (1) clearly shows, both the scale at which the tran- 
sition from a regime where objects in different haloes domi- 
nate the clustering signal to a regime where galaxies within 
the same halo start giving a contribution and the amplitudes 
of the £l h and t^ h terms greatly depend on the radius cho- 
sen to truncate the distribution profiles. More in detail, the 
amplitude of both terms decreases and differences between 
predictions obtained for different profiles become more pro- 
nounced as the value for r cut increases. Finally note that - 
independently of the value of r cut - a stronger clustering sig- 
nal at small scales is in general expected for steeper density 
profiles. 

Results originating from different assumptions for a spa- 
tial distribution of galaxies within dark matter haloes and 
different truncation radii will be further investigated in the 
following sections. 




j i l i i i 



u ' i r[Mpc] 1 

Figure 1. Results for the galaxy-galaxy correlation function as 
obtained from different assumptions for the galaxy distribution 
profiles p(v) and different choices for r C ut- The CaSG for Tcut — ^*vir 
is shown in the top panel, while the one for 

^*cut — 2 ' ^*vir ^ 

presented in the bottom panel. Solid, short-dashed, long-short- 
dashed and dotted lines represent the predictions for a NFW, a 
power-law with = 2.5, a power-law with (3 = 2 and power-law 
with /3 = 3 distribution runs. Lower curves correspond to the 
term in equation (1) (contribution from objects in different 
haloes), while the upper ones indicate the £g h term. All the above 
curves have been calculated by setting in equation (1) (iV ga i) = 
(-^gal C^gal — 1)) = 1 f° r a ^ halo masses greater than 10 10 ' 7 mQ 
and otherwise. 



2.4 Halo Occupation Function 

A key ingredient in the study of the clustering properties of 
galaxies is their halo occupation function p(N sa .\\m) which 
gives the probability for a halo of specified mass m to contain 
iVg a i galaxies. In the most general case, p(iVg a i[ro) is entirely 
specified by the knowledge of its n moments {N gal ) (m) which 
in principle can be observationally determined by means 
of the so-called "counts in cells" analysis (see e.g. Benson, 
2001). Unfortunately this is not feasible in reality, as mea- 
sures of the higher moments of the galaxy distribution get 
extremely noisy for n > 4 even for 2-dimensional catalogues 
(see e.g. Gaztanaga, 1995 for an analysis of the APM sur- 
vey). 

A possible way to overcome this problem is to rely on the 
lower-order moments of the galaxy distribution to determine 
the low-order moments of the halo occupation function, and 
then assume a functional form for p(N ga i\m) in order to 
work out all the higher moments (see e.g. Scoccimarro et 
al., 2001; Berlind & Weinberg, 2002). Clearly, better and 
better determinations of p(N ga i\m) are obtained as we man- 
age to estimate higher and higher moments of the galaxy 
distribution function. 

Since this work relies on measurements of the two-point 
correlation function of 2dF galaxies, equation (1) shows that 



in our case only the first and second moment of the halo 
occupation function can be determined from the data, as 
these are the only two quantities which play a role in the 
theoretical description of £ B . 

Following Scoccimarro et al. (2001), we chose to write the 
mean number of galaxies per halo of specified mass m as 

{A r g a i)(m) = if m < m cut 

(iV ga i)(m) = (m/ino)" 1 if »rtcut < m < m (10) 

(N sa ,i)(m) = (m/mo)° ! if m > m , , 

where m cut , mo, ai and ct2 are parameters to be deter- 
mined by comparison with observations. The choice of the 
above functional form for (N ga \)(m) relies on the physics 
connected with galaxy formation processes (see e.g. Benson 
et al., 2001; Somerville et al., 2001; Sheth & Diaferio, 2001). 
For instance, m cu t gives the minimum mass of a halo able to 
host a galaxy since - for potential wells which are not deep 
enough - galaxy formation is inhibited by supernova pro- 
cesses occurring amongst the first stars which can blow the 
remaining gas away from the halo itself therefore suppressing 
further star-formation. On the other hand, given that the in- 
ternal velocity dispersion of haloes increases with halo mass, 
gas is expected to cool less efficiently and therefore inhibit 
at some level galaxy formation in more massive haloes. In 
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a hierarchical scenario for structure formation, however, the 
most massive objects are formed by merging and accretion 
of smaller units. Thus, one expects the number of galaxies 
to increase with the halo size in the high-mass regime. All 
this can then be parameterized by a broken power-law of 
the form (10), with mo the "threshold mass" at which the 
transition between the two different scaling laws occurs. 

The last ingredient needed for the description of 2-point 
galaxy clustering is the second moment of the halo occupa- 
tion function appearing in equation (1). This term quantifies 
the spread (or variance) about the mean value of the number 
counts of galaxies in a halo. A convenient parameterization 
for this quantity is: 



<AW(JV ga i - l)>(m) = a(m) 2 (iV gal > 2 , 



(11) 



where a(m) = 0, log(m/m cu t)/log(mo/m cut ), 1, respectively 
for m < m cu t, rn cut < m < mo and m > mo. Note that, 
while the high-mass value for a(m) simply reflects a Poisso- 
nian statistics, the functional form at intermediate masses 
(chosen to fit the results from semi-analytical models - see 
e.g. Sheth & Diaferio, 2001; Berlind & Weinberg 2002 - and 
smoothed particle hydrodynamics simulations - Berlind et 
al. 2003) describes the sub-Poissonian regime. 



As a last consideration, note that equation (1) presents in 
both the £g h and terms convolutions of density profiles. 
Since this is somehow difficult to deal with, we prefer to work 
in Fourier space where all the expressions simply become 
multiplications over the Fourier transforms of the profiles. 
Equation (1) is therefore equivalent to: 

n 2 g A g (k) = J n(m)(iV gal (A gal - l))(m)\u m (k)\ 2 dm + 



I 
I 



u mi (k)(N ga i)(mi)n(mi)dm 1 x 



u m2 (fc)n(m 2 )(A r ga i)(m 2 ) A(fc, mi, m 2 )dm 2 , (12) 



where - allowing for the exclusion effects as in equation (6) 

- A(fc,mi,m 2 ) = Adm(fc)&(mi)b(m 2 ) is the power spectrum 
of haloes of mass mi and m 2 (A dm (k) = k /(2n )Pdm(k) is 
the normalized non-linear power-spectrum for dark matter 

- (see e.g. Peacock & Dodds, 1996), 



u m (k) 



_k 

(2' 



>l_ r 

« 2 )Jo 



p m (r)sin(fcr) / (kr)4nr 2 dr, 



(13) 



with pm{r) defined as in (5), is the Fourier transform of 
the galaxy distribution profile truncated at r cut , and where 
all the quantities are implicitly taken at a fixed z. In this 
framework, the galaxy-galaxy correlation function can then 
be obtained from equation (12) via 



J 



A B (k) 



sin(fcr) dk 
kr k 



(14) 



3 THE 2DF GALAXY REDSHIFT SURVEY 

The 2dF Galaxy Redshift Survey (2dFGRS: Colless et al., 
2001) is a large-scale survey aimed at obtaining spectra for 
250,000 galaxies to an extinction-corrected limit for com- 
pleteness of bj — 19.45 over an area of 2151 square de- 



grees. The survey geometry consists of two broad declination 
strips, a larger one in the SGP covering the area 3 h 30 m < 
RA(2000) < 2l' i 40 m , -37.5° < dec(2000) < -22.5° and 
a smaller one set in the NGP with 9 h 50 m < RA(2000) < 
14 h 50 m , 2.5° < dec(2000) < -7.5°, plus 100 random 2- 
degree fields spread uniformly over the 7000 square degrees 
of the APM catalogue in the southern Galactic hemisphere. 

The input catalogue for the survey is a revised ver- 
sion of the APM galaxy catalogue (Maddox et al. 1990a, 
1990b, 1996) which includes over 5 million galaxies down to 
bj = 20.5. Redshifts for all the sources brighter than bj = 
19.45 are determined in two independent ways, via both 
cross-correlation of the spectra with specified absorption- 
line templates (Colless et al., 2001) and by emission-line 
fitting. These automatic redshift estimates have then been 
confirmed by visual inspection of each spectrum, and the 
more reliable of the two results chosen as the final redshift. 
A quality flag was assigned to each redshift: Q = 3, Q — 4 
and Q = 5 correspond to reliable redshift determination, 
Q = 2 means a probable redshift and Q = 1 indicates no 
redshift measurement. The success rate in redshift acquisi- 
tion for the surveyed galaxies (determined by the inclusion 
in the 2dF sample of only those objects with quality flags 
Q = 3 to Q — 5) is estimated about 95 per cent (Folkes et 
al., 1999). The median redshift of the galaxies is 0.11 and 
the great majority of them have z < 0.3. 



3.1 Luminosity Functions and Number Densities 

Madgwick et al. (2002) calculate the optical bj luminosity 
function (LF) for different subsets of M — 51ogio(ho) < — 13 
2dFGRS galaxies defined by their spectral type. The spec- 
tral classification - based upon a Principal Component Anal- 
ysis - was performed for 75,589 galaxies found at redshifts 
0.01 < z < 0.15 and allowed to divide the whole population 
into four well-defined classes according to the strength of 
their star-formation activity: from type 1 (early-type galax- 
ies only showing absorption lines in their spectra) to type 4 
(extremely active star-forming galaxies). 

Fitting functions for the different luminosity distribu- 
tions are presented in Madgwick et al. (2002) and can be 
used to determine the average number density of galaxies of 
different spectral types via (see e.g. Lin et al., 1996): 



= N { x 



Si(z)(dV/dz)dz 



with the selection function Si(z) defined as 



Si(z) = 



m a x[M* 



X (z)-M„ 
(z)-M m 



$ t (M)dM 



(15) 



(16) 



where dV/dz is the volume element, i — 1,2,3,4 refers 
to the spectral class, Ni is the total number of galax- 
ies belonging to a specific type found in the survey and 
$i is their luminosity function. M m i n = — 13 + 51ogio(ho) , 
Zmin = 0.01 and M max ~ -21 + 51ogi (h ), z max = 0.15 re- 
spectively are the minimum and maximum absolute mag- 
nitudes and redshifts of the objects under exam, while 



M 



min(max) 



^m a x(min) 



25 — 51og(d^) — ki(z), where 



ki(z) is the K-correction, d % L the luminosity distance and 
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&min = 14, 6max = 19.45 are the apparent magnitude limits 
of the 2dF survey. 

We have then applied equation (15) to the four different 
galaxy types and found: 

q f -4.4 ■ 1CT 3 
" 10 { +7.0-10- 

n 92 = 0.0103 ±3 -10~ 4 

{_ a in- 4 
+9 ■ 10- 4 (17) 

n 94 =0.018 ±1.8- 10~ 3 , 

where the quoted la errors, in all but the type 1 case, 
are obtained by varying the 'break' luminosities M* and 
the faint-end slopes fii of the Schechter functions "^(M) = 
(0.4 lnl0)*r(10-°- 4(M - M '* ) ) 1+w exp(-10-°- 4 ( M - M *)) (note 
that our calculations are independent of the normalizations 
$|) along their joint la error ellipse. 

Two features have to be noticed in (17). First, errors on 
n gi - especially the upper la limit of the number density 
- are significantly larger than those derived for the other 
three classes of sources. This is due to the fact that the 
Schechter function might not provide a good fit to the faint 
end of the LF of type 1 galaxies (Madgwick et al., 2002); 
an extra term of the form $(M) = lQ a+bM is needed for 
M — 51ogio(ho) > —16, where the parameters a and b can 
only be determined from the data with quite large uncer- 
tainties. Since it is the faint end of the LF which mostly 
contributes to the determination of n gi , and this is the re- 
gion where the errors on a and b dominate over those derived 
for the various parameters in the Schechter function, this ex- 
plains our finding for such large uncertainties associated to 
the measurement of n 91 . 

The second point to be noticed concerns the errors asso- 
ciated to n S4 . In fact it turns out that, also in the case 
of type 4 galaxies, the Schechter function does not pro- 
vide a good fit to the faint end of the luminosity func- 
tion as it systematically overestimates the number density 
of M — logio(ho) > —16 galaxies. This implies that the total 
number of type 4 sources as derived from integration of the 
LF is in agreement with the observed one only if we subtract 
to the best estimate n 9i in eq. (17) an error corresponding 
to a 3a confidence level in \x±. Since, as we will see better 
in the next Section, the observed number density of galax- 
ies plays a relevant role in the determination of the best 
halo occupation model, we have then decided in the case of 
type 4 sources to consider errors on fi4 at the 3a level; this 
propagates to a lower limit for n 94 = 0.0127. 

Finally, from the luminosity function we can also deter- 
mine another quantity which will be useful in the following 
Sections: the effective redshift of a class of sources defined as 
zf — J z Zmax zSi(z)(dV/dz)dz. Numbers obtained for the dif- 
ferent types of galaxies under exam then read: zf = 0.098, 
zf = 0.091, zf = 0.082, zf = 0.078, indicating a pref- 
erence for late-type galaxies to be found at lower redshifts 
than early-type objects. 

3.2 Correlation Functions 

A sample with the same selection criteria as the one intro- 
duced in Section 3.1 but containing more (96,791) objects 



has been used by Madgwick et al. (2003) to calculate the 
clustering properties of galaxies belonging to different spec- 
tral types. In order to increase the statistics associated to the 
measurements, galaxies have been grouped into two broad 
categories: early-types - 36318 objects with effective redshift 
0.1 to be identified with those sources belonging to spectral 
class 1 - and late-type - 60473 objects with effective red- 
shift 0.09 obtained by taking into account all galaxies from 
spectral classes 2 3 and 4 - ones. 

The different observed correlation functions are shown 
in Figures (2) and (6) for early-type and late-type galaxies, 
respectively. In order to get rid of redshift distortions, the 
correlation function in real space has been inferred by com- 
puting the bi-dimensional correlation parallel and transverse 
to the line-of-sight, £g(rp,rr), and by integrating it in the 
rp direction. The quantity presented in Figures (2) and (6) 
therefore corresponds to 

t 9{rT) = 2 £ Ur \r*-%y/> - (18) 

Error-bars have been obtained by bootstrap resampling, 
adapting the method presented by Porciani & Giavalisco 
(2002). 

Measurements for the integrated correlation function 
(18) differ for the two populations not only in their ampli- 
tude, but also in the slopes (Madgwick et al., 2003 - by 
fitting the data with a power-law £(r) = (r/ro) 7 - find 
r = 3.67 ± 0.30 h" 1 Mpc, 7 = 1.60 ± 0.04 for late-type 
galaxies and r = 6.10 ± 0.34 h~ 1 Mpc, 7 = 1.95 ± 0.03 for 
early- type sources). As we will extensively see in the next 
Sections, these differences can provide a great amount of in- 
formation on the processes associated to galaxy formation 
in the two different cases of passive and active star-forming 
galaxies. 



4 RESULTS 

In order to determine the best values for the parameters 
describing the halo occupation number (10) we allowed them 
to vary within the following region: 

-1 < ai < 2; 
-1 < a 2 < 2; 
10 9 m < m cut < 10 13 m Q ; 
m cut < mo < m cut ■ 10 3 . 

Combinations of these four quantities have then been used 
to evaluate the mean number density of galaxies n g via equa- 
tion (4). Only values for n g within 2a from the observed ones 
(quoted in Section 3.1) were accepted and the correspond- 
ing values for ot\ , 012, m cut and mo have subsequently been 
plugged into equation (1) to produce - for a specified choice 
of the distribution profile and effective redshift z e (slightly 
different in the case of early-type and late-type galaxies) - 
the predicted galaxy-galaxy correlation function to be inte- 
grated via eq. (18) and compared with the data by means 
of a least squares (x 2 ) fit. 

The value for the truncation radius of the halo was set 
to r v i r and the above procedure repeated for different choices 
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Figure 2. Projected correlation function of early- type galaxies. 
Data-points represent the results from Madgwick et al. (2003), 
while the solid curve is the best fit to the measurements obtained 
for a halo number density of the form (10), with ai = —0.2, 



a 2 = 1.1, m cut 



10 1 
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distributed within their dark matter haloes according to a NFW 
profile. Dashed and dotted lines respectively indicate the contri- 
bution £g from galaxies residing in different haloes and the 
term originating from galaxies within the same halo. 
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Figure 3. Projected correlation function of early-type galaxies. 
Data-points represent the results from Madgwick et al. (2003), 
while solid, dashed, dotted and dashed-dottcd curves illustrate 
the best-fit models respectively obtained for a NFW, a power-law 
with p = 2.5, a power-law with (3 = 2 and a power-law with /3 = 3 
galaxy distribution profiles (see text for details). The arrow shows 
the lower limit to the measured correlation function set on small 
scales by fibre collisions in the 2dF survey (Ed Hawkins, private 
communication) . 



of p m (r). The following sub-sections describe the results ob- 
tained for the two different classes of Early- and Late-type 
galaxies. 

4.1 Early- type galaxies 

The best description of the data in this case is provided 
by a model with ai ~ —0.2, Q2 = 1.1, m cut = 10 12 ' 6 m©, 
mo = lO 13 ' 5 rri0 (in good agreement with the findings of Ze- 
havi et al., 2003) and with a mild preference for galaxies 
to be distributed within their dark matter haloes according 
to a NFW profile. The projected galaxy-galaxy correlation 
function for this combination of values and the NFW spatial 
distribution is illustrated by the solid curve in Figure (2), 
while the dashed and dotted lines respectively indicate the 
contribution from galaxies residing in different haloes 
and the £g term originating from galaxies within the same 
halo. The agreement between data and predictions is good 
at all scales even though the model tends to underestimates 
the correlation function at intermediate scales (between 3 
and 10 Mpc), where the maximum discrepancy is ~ 20 per 
cent. 

A more quantitative assessment of the goodness of the 
match can be found in Table 1 which - for each choice of 
the distribution profile p m (r) and for the different classes 
of early-type and late-type galaxies - provides the value of 
the x 2 = Xrnin obtained for the best fit to the data and 
the corresponding estimates for the parameters which ap- 
pear in the description of the halo occupation number (10). 
la errors on these quantities are obtained by requiring their 



different combinations not to produce models for the galaxy- 
galaxy correlation function which - when compared to the 
measurements - correspond to x 2 values which differ from 
the minimum by a factor greater than Ax 2 = 3.53, where 
this last figure has been derived for an analysis with three 
degrees of freedom (assuming Gaussian errors). Three is in 
fact the number of degrees obtained if one subtracts to the 
number of independent £ 9 measurements (eight - Darren 
Madgwick, private communication) * the number of param- 
eters to determine (four) and the (one) constraint on n g . 

A closer look to Table 1 shows that, in the case of early- 
type galaxies, the best-fit values for the parameters appear- 
ing in equation (10) are independent - except for the most 
extreme cases of very poor fits - of the particular choice for 
the spatial distribution of galaxies within the haloes. This 
is verified even though different profiles are associated to 
different values for xLn an d is due to the fact that, while 
quantities such as {N ga \) and (./V 2 al ) (and therefore the pa- 
rameters associated to them) mainly determine the ampli- 
tudes of the £g h and £ 2/l terms, the distribution of galaxies 
within the haloes is directly responsible for the slope of £ 3 , 
especially on scales r < 1 Mpc. 

This effect is better seen in Figure (3) which presents the 
best-fit models obtained for different choices of the distri- 
bution run. More in detail, the solid line is for a NFW pro- 

* It is in some sense arbitrary to decide how many principal com- 
ponents of the correlated errors correspond to a real signal and 
how many correspond to noise. This can be done, for instance, by 
considering a fixed fraction of the variance (see, e.g., Porciani & 
Giavalisco 2002). 
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file, the dotted line for a power-law profile with (3 = 2, the 
dashed line for a power-law profile with j3 = 2.5 and the 
dashed-dotted line represents the case of a power-law profile 
with f3 — 3. Even though all the curves are almost indistin- 
guishable from each other on all scales r > 2 Mpc due to 
the very similar best-fit values obtained for the parameters 
which describe {N SB .\} and (-/Vgai (iVg a i — 1)), this does not hold 
any longer at smaller distances. In fact, when we push the 
analysis inside the haloes, the profile assumes a crucial im- 
portance; for instance, in the case of early-type galaxies it is 
clear that, while profiles such as NFW and singular isother- 
mal sphere can provide a good fit to the observations (with a 
slight preference for the first model) , anything steeper than 
these two is drastically ruled out by the data since it does 
not exhibit enough power on scales 0.3 < r/[Mpc] < 2. 

Table 1 states the same conclusion from a more quan- 
titative point of view showing that, while the NFW profile 
provides the best description of the data and the value of 
Xmin f° r a P — 2 model is still within the la range of the 
acceptable fits (for Gaussian errors), profiles of the form 
power-law with (3 > 2.5 are too steep to be accepted as 
satisfactory descriptions of the observed galaxy correlation 
function. We remark once again that, since the spatial distri- 
bution of galaxies within haloes of specified mass and their 
mean number and variance affect different regions of the ob- 
served correlation function, these two effects are untangled 
so that there is no degeneracy in the determination of the 
(•Wgai) an d p(r) preferred by the data. 

Since different profiles lead to notably different predic- 
tions for the correlation function on increasingly smaller 
(r < 0.2 Mpc, see Figure 3) scales, one would in princi- 
ple like to be able to explore this region in order to put 
stronger constraints on the distribution of galaxies within 
their haloes. Unfortunately, this cannot be done because - 
as Hawkins et al. (2003) have shown - fibre collisions in the 
2dF survey can significantly decrease the measured £ s (r) on 
scales r < 0.1 Mpc, leading to systematic underestimates. 
Nevertheless, one can still use the 2dF measurements of the 
correlation function as a lower limit to the real galaxy-galaxy 
clustering strength, to be compared with predictions steam- 
ing from different density runs. This is done in Figure 3, 
where the arrow represents the 2dF measurement of the 
early-type correlation function on a scale r ~ 0.07 Mpc. In 
this particular case, all the models appear to have enough 
power on such small scales as none of them falls below the 
accepted range of variability of the measured £ 9 (r). There- 
fore, on the basis of the present data, we cannot break the 
degeneracy between NFW and PL2 profiles as to which one 
can provide the best description of the data in the whole r 
range. 

If we then concentrate our attention on the two distri- 
bution profiles that can correctly reproduce the observations 
(NFW and power-law with /3 = 2, hereafter PL2) and ana- 
lyze the best-fit values obtained for the parameters describ- 
ing the halo occupation number and their associated errors, 
we find for instance that, while the slope 02 - which deter- 
mines the increment of the number of sources hosted by dark 
matter haloes of increasing mass in the high-mass regime - 
is very well determined, the situation is more uncertain for 
what concerns ai, counterpart of a-2 in the low-mass regime. 
On the other hand, m cu t and mo exhibit errors of similar 
magnitudes, with upper limits better determined than the 
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Figure 4. Projected correlation function of late-type galaxies. 
Data-points represent the results from Madgwick et al. (2003), 
while solid, dashed, dotted and dashed-dotted curves illustrate 
the best-fit models respectively obtained for a NFW, a power-law 
with j3 = 2.5, a power-law with /3 = 2 and a power-law with 
fi = 3 galaxy distribution profiles and for a truncation radius 
i"cut = f v ir- The curve flattening around r < 0.5 Mpc illustrates 
the contribution ^ h from galaxies residing in different haloes (see 
text for details) . The arrow shows the lower limit to the measured 
correlation function set on small scales by fibre collisions in the 
2dF survey (Ed Hawkins, private communication). 



lower ones, this last effect possibly due to the constraints on 
n g which discard every model not able to produce enough 
galaxies as it is the case for high values of m cu t and mo. 
An analysis of the x 2 hypersurface also shows that all the pa- 
rameters but ot2 (and especially m cut and mo) are covariant. 
This means that, in order to be consistent with the avail- 
able data, decreasing m cut with respect to its best-fitting 
value implies lowering mo and increasing on. The interplay 
is probably due to the fact that both ot\ and m cut only 
play a role in the low-mass regime which mainly affects the 
intermediate-to-large-scale regions of the galaxy correlation 
function, where measurements are more dominated by un- 
certainties. Conversely, £ 9 on small scales is strongly depen- 
dent on the adopted value of Q2, therefore making this last 
quantity measurable with an extremely high degree of pre- 



4.2 Late- type galaxies 

The case for late-type galaxies appears more tricky to treat 
in the framework of our analysis due to the shallow slope of 
the observed correlation function (see Section 3.2). In fact, 
as it can be appreciated in Figure (4), no model can correctly 
describe the slow rise of the data on scales r < 2 Mpc, even 
though the large-scale normalization of all the curves repro- 
duces with a good approximation the measured one. This is 
shown in a more quantitative way in Table 1, which quotes 
the minimum \ 2 values obtained for different distribution 
profiles and a truncation radius r cut = rvi r : no model gives 
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Table 1. Best-fit values for the parameters describing the halo occupation number (10), expressed for different choices of the distribution 
profile and truncation radius. PL2, PL2.5 and PL3 respectively correspond to profiles described by power-laws with slopes j3 = 2, j3 = 2.5 
and f3 = 3. Errors are for a 68.3% confidence level and three degrees of freedom. n g (in Mpc -3 ) and b are the average number density of 
sources and the bias on large scales as derived from the combination of the best-fit parameters associated to each model. All the masses 
are measured in mg units. Vanishing errorbars mean that either the best-fitting value lies on the boundary of the sampled parameter 
space (e.g. when ce± = —1) or that the uncertainty is smaller than our grid step (0.1 for all parameters). 
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an acceptable fit and the corresponding figures for Xmin are 
- in the best case - around 39. 

A closer look at Figure (4) indicates that the problem has 
to be connected with the excess of power on intermediate 
scales exhibited by the £ 2h contribution of pairs of galaxies 
from different haloes (solid curve which flattens on scales 
~ 0.5 Mpc), which creates the mismatch between the ob- 
served slope and the steeper "best-fit models", fn other 
words, what the plot reveals is that in the model there are 
too many pairs of galaxies coming from different haloes at 
distances 0.5 < r/[Mpc] < 2 with respect to what the data 
seem to indicate. 

A possible way out is therefore to deprive the 
intermediate-scale region of pairs of objects residing in dif- 
ferent haloes. This can be done if one assumes that the dis- 
tribution of late-type galaxies associated to a given halo 
does not vanish at the virial radius, but extends to some 
larger distance. The mutual spatial exclusion of haloes then 
ensures that there will be no pairs of galaxies belonging 
to different haloes in the desired range. Indeed, it is not 
surprising to find SO and disc galaxies futher away than 
a virial radius from a cluster centre (see e.g. figure 8 in 
Dominguez, Muriel & Lambas, 2001). We can attempt to 
let the galaxy distribution extend to - say - two virial radii. 
As Figure (5) shows, this assumption greatly reduces the 
discrepancy between models and measurements: the theo- 
retical curves now have the right slope and amplitude on 
scales 0.5 < r/[Mpc] < 2 and the corresponding best fits to 
the data exhibit Xmin values which are (almost) as good as in 
the case of early-type galaxies (see Table 1). Note that, even 
though we adopted a somehow "ad hoc" procedure to find 
a better description of the measurements, our finding seems 
to point out to the well-known phenomenon of morphologi- 
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Figure 5. Projected correlation function of late- type galaxies. 
Data-points represent the results from Madgwick et al. (2003), 
while solid, dashed, dotted and dashed-dotted curves illustrate 
the best-fit models respectively obtained for a NFW, a power-law 
with /3 = 2.5, a power-law with (3 = 2 and a power-law with 
P = 3 galaxy distribution profiles and for a truncation radius 
r C ut = 2 ■ r v i r . The arrow shows the lower limit to the measured 
correlation function set on small scales by fibre collisions in the 
2dF survey (Ed Hawkins private communication). 



10 M.Magliocchetti, C.Porciani 



cal segregation (see e.g. Madgwick et al., 2003; Dominguez, 
Muriel & Lambas, 2001; Giuricin et al., 2001 and Adami et 
al., 1998 for some recent results), whereby late- type galaxies 
tend to be found in the outer regions of groups and clusters, 
while early-type ones preferentially sink into the group or 
cluster centre. A simple calculation in fact shows that, for 
scales 0.5 < r/[Mpc] < 2, haloes which in our model are 
required to host star-forming galaxies up to two virial radii 
have masses in the range 10 11 ' 9 < m/rriQ < 10 13 ' 7 . As we 
will see better in the next Section, haloes within this mass 
range are expected to host on average 3 < (Agai) ^ 50 late- 
type galaxies, limits which span from a small group to a 
cluster of galaxies. 

Our result therefore does not contradict the well established 
fact that galaxies form within the virialized regions of dark 
matter haloes (i.e. at a distance < r v i r from their centre). 
What it simply states is that - possibly due to merging pro- 
cesses and accreting flows - late-type galaxies in groups and 
clusters are found within their dark matter haloes up to 
distances from the centre corresponding to about two virial 
radii. Note that we are not claiming that these galaxies are 
sling-shot towards their final position during the merging 
event. The key idea is that, at a certain point, in the pro- 
cess of approaching a merging event, galaxies residing in the 
progenitors of a given halo will be associated with the final 
object itself as their host halos loose their identity by the 
formation of high density "bridges" which alter the output 
of cluster-finding algorithms like the friends-of-friends one 
(on which both our mass function and bias parameters are 
based). This phenomenon might be less important for the 
population of early-type galaxies which, probably, form in 
galaxy mergers that tend to be more concentrated within 
their host haloes. 

We now discuss the results on the halo occupation num- 
ber and distribution profile obtained for the population of 
late-type galaxies under the assumption of a truncation ra- 
dius corresponding to two virial radii. As Table 1 shows, the 
best fit to the data in this case is provided by a model with 
Qi ~ —1, c*2 = 0.7, m cut = lO 111 m0, mo = 10 11A rriQ and 
for galaxies distributed within their dark matter haloes ac- 
cording to a PL2 profile, even though on the basis of this 
analysis we cannot really discard any p(r) model but PL3 
since they all show Xmin values within la from the favourite 
one (as for early-type galaxies, a model is accepted as a fair 
description of the data if the corresponding Xmin ue within 
A\ 2 = 3.53 from the best fit). 

In order to shed some more light on the distribution 
profiles able to correctly describe the correlation function 
of star-forming galaxies, we have then adopted the same ap- 
proach as in the previous sub-section and used the lowest-rT 
(~ 0.07 Mpc) measurement of £ 9 (r) derived by the 2dF team 
for late-type objects (Ed Hawkins, private communication) 
as a lower limit to the true galaxy clustering strength, to be 
compared with the available models. This is done in Figure 
5, where the lowest-scale data is represented by the vertical 
arrow. At variance with the case of early-type galaxies, the 
singular isothermal sphere model - even though providing 
the best fit to the r > 0.1 Mpc data - is now ruled out by 
the behaviour of the observed correlation function which in 
this regime needs steeper profiles. On the basis of the above 
comparison, one then has that only the PL2.5 a NFW den- 
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Figure 6. Projected correlation function of late-type galaxies. 
Data-points represent the results from Madgwick et al. (2003), 
while the solid curve is the best fit to the measurements obtained 
for a halo number density of the form (10), with ct± = —0.4, 
02 = 0.7, m C ut = lO 11 rri0, mo = 10 11 ' 4 rriQ and for galaxies 
distributed within their dark matter haloes according to a NFW 
profile with r cu t = 2 ■ r v ; r . Dashed and dotted lines respectively 
indicate the contribution from galaxies residing in different 
haloes and the term originating from galaxies within the same 
halo. 



sity runs can still be accepted as reasonable descriptions of 
the data, with a slightly stronger preference given to the 
last model since it also provides the (second) best-fit to the 
observations on scales r > 0.1 Mpc. 

The projected galaxy correlation function originating from 
the combination of values given in Table 1 and the NFW 
profile is illustrated by the solid curve in Figure (6). The 
dashed and dotted lines respectively indicate the contribu- 
tion £g h from galaxies hosted in different haloes and the 
term given by galaxies residing within the same halo. Note 
that, as in Figure (2), our best-fitting models tend to un- 
derestimate the observed correlation at intermediate scales 
(between 5 and 15 Mpc) by ~ 20 per cent. 

The uncertainties associated to the parameters describ- 
ing the low-mass regime of the halo occupation number seem 
to be bigger than what found for passive galaxies. This re- 
sult is however misleading, since the larger errors quoted for 
Qi and associated to the la lower limit of m cut only reflect 
the fact that the low-mass regime in equation (10) is prac- 
tically non-existent for star- forming galaxies as mo — m cut . 
In this case one than has that the halo occupation number 
behaves as the pure power-law: (A r ga i)(m) = (m/mo)° ! at 
almost all mass scales, where both Q2 and mo are very well 
constrained. 

As it was in the case of early-type galaxies, an analysis of 
the \ 2 hypersurface helps understanding the interplay be- 
tween the different parameters describing the halo occupa- 
tion number. We find that, while «2 and mo show little vari- 
ability, ai and m cut are strongly covariant, whereby higher 
values for the former quantity correspond to lower mini- 
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mum masses associated to haloes able to host a star-forming 
galaxy. 

As a final remark, we note that - also for late-type 
galaxies - values for ai, a 2 , m cut and mo which provide the 
best fit to the data do not depend on the particular form 
adopted for the spatial distribution of galaxies within the 
haloes. Once again, this finding stresses the absence of de- 
generacy in the determination of quantities such as p(r) and 
the average number of galaxies in haloes of specified mass, 
(N ga i)(m): as long as one can rely on clustering measure- 
ments which probe the inner parts of dark matter haloes, 
both functional forms can be obtained with no degree of 
confusion from the same dataset. 

4.3 Scatter of the Halo Occupation Distribution 
and Number Density Profiles 

All our results for the galaxy density profiles are derived by 
assuming a specific functional form for (N ga ,i(N ga ,\ — l))(m), 
as given in equation (11). In principle, this could bias our 
determination of the spatial distribution of galaxies within 
a single halo (Berlind & Weinberg 2002). In order to under- 
stand the importance of this effect, we repeated the analysis 
of the correlation functions by assuming 3 different func- 
tional forms for the second moment of the halo distribution 
function. In particular, we took a = 1,0.3,0.1 in equation 
(11) independently of the halo mass. We found that none 
of these models can fit the data as well as our original pre- 
scription. However, for purely Poissonian scatter, the best- 
fitting models for late-type galaxies are almost identical to 
our fiducial models, and the ranking of the density profiles 
does not change. On the other hand, for early-type galax- 
ies, Poissonian models are associated with large values of \ 2 
(23.7 at best, for PL2) due to the overabundance of power 
on small scales (the favorite values for m cut and mo tipically 
are ~ 10 times smaller than in our fiducial case) but, still, 
only the PL2 and NFW profiles are acceptable. When the 
scatter, instead, is strongly sub-Poissonian (a = 0.1,0.3) 
we find that it is practically impossible to get a good de- 
scription of £ 3 . In fact, in this case, the 1-halo term in the 
correlation function is heavily depressed and one is forced 
to increase the value of a 2 and m cut to try to match the 
data. Anyway, all the models are unacceptable due to lack 
of power on small scales, and all the profiles are associated 
with nearly the same values of \ 2 ■ In summary, even though 
we confirm the presence of some degeneracy between the 
second moment of the halo occupation distribution and the 
profile of the galaxy distribution as discussed in Berlind & 
Weinberg (2002), we found that it is extremely hard to find 
models that can give accurate description of the data. This 
means that the apparent freedom in assuming a functional 
form for {N gal (N gal — l))(m) is not such. As a consequence 
of this, we are led to believe that our conclusions regarding 
the density profiles of 2dF galaxies are indicative of a real 
trend, even though they are indeed drawn in the framework 
of a specific model. 

5 THE GALAXY MASS FUNCTION 

The results derived in the previous sections allow us to draw 
some conclusions on the intrinsic nature of galaxies of dif- 



ferent types. Figure (7) shows the average number of galax- 
ies per dark matter halo of specified mass m as obtained 
from the best fits to the clustering measurements of Madg- 
wick et al. (2003). The dashed line describes the case for 
star-forming galaxies, while the solid line is for early-type 
objects. 

Late-type galaxies are found in haloes with masses 
greater than ~ 10 11 m© (even though this figure, espe- 
cially in its lower limit, is not determined with a great accu- 
racy given the interplay between m cut and on), and their 
number increases with the mass of the halo which hosts 
them according to a power-law (except in the limited re- 
gion 10 110 < m/m < lO 114 ) with slope a 2 = 0.7 and 
normalization 1/mo = 10 _11 ' 4 mg 1 . 

Early-type galaxies instead start appearing within haloes of 
noticeably higher masses, m > 10 12 ' 6 m . In the low-mass 
region (i.e. for m < 10 13 ' 5 m Q ), the data seems to indicate 
that each halo is on average populated by approximately 
one passive galaxy, even though results in this mass range 
are affected by some uncertainties. More solid are the find- 
ings in the high mass (m > 10 13 ' 5 m Q ) regime, which show 
{N ga .i} to increase with halo mass as a power- law of slope 
a.2 — 1.1, steeper than what found for the population of 
late-type galaxies. One then has that the average number of 
star-forming galaxies within a halo does not increase with its 
mass as fast as it happens for passive galaxies, even though 
late-type objects are found to dominate the 2dF counts at 
all mass scales. This result seems in disagreement with the 
observational evidence that early-type galaxies are preferen- 
tially found in clusters, while star-forming galaxies mainly 
reside in relatively underdense regions. The discrepancy is 
however only apparent since, while in the Madgwick et al. 
(2003) analysis the population of early-type galaxies is an 
homogeneous sample, the class of late-type galaxies includes 
any object which has some hint of star-formation activity in 
its spectrum (see Section 3). This implies that anything - 
from highly irregular and star-bursting galaxies to SO types 
- will be part of the star-forming sample. The lack of a 
characteristic mass for the transition from haloes mainly 
populated by late-type galaxies to haloes principally inhab- 
ited by passive objects then finds a natural explanation if 
one considers that SO galaxies - also preferentially found in 
groups and clusters - are in this case associated to the pop- 
ulation of late-type objects, making up for about 35 per cent 
of the sample (Madgwick et al., 2002). We also remind that 
both the 2dF and its parent APM surveys select sources in 
the blue (bj) band, therefore creating a bias in favour of 
star-forming galaxies which are more visible than passively 
evolving ellipticals in this wavelength range. 

It is interesting to compare our results with those by 
van den Bosch et al. (2003) who used 2dF data to esti- 
mate the conditional luminosity functions of early- and late- 
type galaxies. They combined their results to obtain the 
mean halo occupation number of galaxies in given absolute- 
magnitude ranges. Since our analysis is based on a apparent- 
magnitude limited sample, this complicates the compari- 
son. For their faintest late-type galaxies, the behaviour of 
(Agai)(m) is relatively similar to our findings with a cut- 
off below ~ 10 n mo, a small decrement up to lO n ' 5 m,0 
and a power-law regime dlog(A r ga i)(m)/dlogm ~ 0.6 for 
larger masses. As for early-type galaxies, results are in good 
agreement, with a power-law high-mass regime with slope 
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Figure 7. Average number of galaxies (7V ga i (m)) per dark matter 
halo of specified mass m (expressed in mg units). The solid line 
represents the case for early-type galaxies, while the dashed line 
is for late-type objects. 
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and volume (in Mpc -3 units). The solid line represents the re- 
sult for early- type galaxies, while the dashed line is for late- type 
objects. For comparison, the dotted line indicates the Sheth &: 
Tormen (1999) mass function n(m) of dark matter haloes. 



dlog(AT ga i)(m)/dlog m ~ 1. However, van den Bosch et 
al. (2003) find a less sharp cutoff at small masses, with 
log(mo/m0) ~ 12.5 and (JV ga i(m)) gently declining with 
decreasing m till to lO lo ' 5_11 m0. Given all the systematic 
uncertainties belonging to the two method of analysis the 
similarity of the results is remarkable, confirming the po- 
tential of the halo occupation distribution method. 

Finally, Figure (8) shows the "galaxy mass function" i.e. 



the number density of galaxies per unit of dark matter mass 
and volume as obtained by multiplying the average number 
of galaxies found in a halo of specified mass {N m \ (m)) by the 
halo mass function (8). Again, the solid curve identifies the 
case for early-type galaxies, while the dashed one is derived 
for the population of star-forming galaxies. The dotted line 
indicates the Sheth & Tormen (1999) halo mass function (8). 



6 CONCLUSIONS 

Results from Madgwick et al. (2002) and (2003) on the cor- 
relation function and luminosity function of ~ 96, 000 2dF- 
GRS galaxies with 0.01 < z < 0.15 have been used to inves- 
tigate some of the properties of early- and late-type galaxies, 
such as the so-called halo occupation number (AT ga i) (i.e. the 
mean number of sources that populate a halo of given mass 
m) and the spatial distribution of such galaxies within their 
dark matter haloes. 

In order to perform our analysis, we have considered 
four distribution profiles: three power-laws of the form 
p(r) oc r _/3 with (3 = 2, 2.5, 3 and a NFW profile chosen 
to mimic the assumption for galaxies within haloes to trace 
the distribution of dark matter. As a first approximation, all 
the profiles have been truncated at the halo virial radius. 
For consistency with results from semi-analytical models 
(see e.g. Benson et al., 2001), the halo occupation num- 
ber was parametrized by a broken power-law of the form 
(m/mo)" 1 in the low mass regime m cut < m < mo and 
(m/mo)" 2 at higher masses, where m cut is the minimum 
mass of a halo that can host a galaxy. 

The resulting theoretical average number density n g 
and galaxy-galaxy correlation function - sum of the two 
terms £g h , representing the contribution from galaxies re- 
siding within the same halo, and which considers pairs 
belonging to different haloes - have then been compared 
with the observations in order to determine those models 
which provide the best description of the data both in the 
case of late- type and early- type galaxies. Note that, at vari- 
ance with previous works which only considered a linear t^f 1 , 
our analysis provides a full treatment for non-linearity and 
also includes the assumption of halo-halo spatial exclusion, 
in a way that makes the model entirely self-consistent. 

The main conclusions are as follows: 

(i) Early-type galaxies are well described by a halo oc- 
cupation number of the form broken power-law (10) with 
Qi ~ —0.2, a2 — 1.1, m cu t — lO 12,6 m0 and mo — lO 13 ' 5 m0, 
where the two quantities which determine the intermediate- 
to- high mass behaviour of (iV ga i) are measured with a good 
accuracy. 

(ii) No model can provide a reasonable fit to the cor- 
relation function of late-type galaxies since they all show 
an excess of power with respect to the data on scales 
0.5 < r/[Mpc] < 2. In order to obtain an acceptable descrip- 
tion of the observations, one has to assume that star-forming 
galaxies are distributed within haloes of masses comparable 
to those of groups and clusters up to two virial radii. This 
result is consistent with the phenomenon of morphological 
segregation whereby late-type galaxies are mostly found in 
the outer regions of groups or clusters (extending well be- 
yond their virial radii), while passive objects preferentially 
sink into their centres. 
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(iii) With the above result in mind, one finds that late- 
type galaxies can be described by a halo occupation number 
of the form single power-law with ai ~ 0.7, m cu t ^ 10 n mo 
and mo ~ lO 11,4 m0, where the quantities which describe 
{Nggi) in the high- mass regime are determined with a high 
degree of accuracy. 

(iv) Within the framework of our models, galaxies of any 
kind seem to follow the underlying distribution of dark mat- 
ter within haloes as they present the same degree of spatial 
concentration. In fact the data indicates both early-type 
and late-type galaxies to be distributed within their host 
haloes according to NFW profiles. We note however that, 
even though early-type galaxies can also be described by 
means of a shallower distribution of the form p(r) oc r~ 2 , 
this cannot be accepted as a fair modelling of the data in 
the case of late-type galaxies which instead allow for some- 
how steeper (f3 ~ 2.5) profiles. In no case a /3 = 3 density 
run can provide an acceptable description of the observed 
correlation function. These conclusions depend somehow on 
assuming a specific functional form for the second moment 
of the halo occupation distribution. We showed, however, 
that there is not much freedom in the choice of this function 
is one wants to accurately match the observational data. 

An interesting point to note is that results on the spatial 
distribution of galaxies within haloes and on their halo occu- 
pation number are independent from each other. There is no 
degeneracy in the determination of (iV ga i) and p(r) as they 
dominate the behaviour of the two-point correlation func- 
tion £ s at different scales. Different distribution profiles in 
fact principally determine the slope of £ 9 on small enough 
(r < 1 Mpc) scales which probe the inner regions of the 
haloes, while the halo occupation number is mainly respon- 
sible for the overall normalization of £ 9 and for its slope on 
large-to-intermediate scales. 

Our analysis shows that late- type galaxies can be hosted 
in haloes with masses smaller than it is the case for early- 
type objects. This is probably due to the fact that early-type 
galaxies are on average more massive (where the term here 
refers to stellar mass) than star-forming objects, especially 
if one considers the population of irregulars, and points to 
a relationship between stellar mass of galaxies and mass of 
the dark matter haloes which host them. 

The population of star-forming galaxies is found to be 
the dominant one at all mass scales, result that can be rec- 
onciliated with the well established observational fact that 
early-type galaxies are preferentially found in clusters, while 
star-forming galaxies mainly reside in relatively underdense 
regions, by considering that about a third of the class of 
late-type sources in our sample is made of SO galaxies, which 
are also preferentially found in groups and clusters. We also 
stress that both the 2dF and its parent APM surveys se- 
lect sources in the blue band, therefore creating a bias in 
favour of star-forming galaxies which are more visible than 
passively evolving ellipticals in this wavelength range. 

As a final remark, we note that the results of this work 
are partially biased by the need to assume a pre-defined 
functional form for both the halo occupation number and the 
variance about this quantity. High precision measurements 
of higher moments of the galaxy distribution function (such 
e.g. the skewness and the kurtosis) are of crucial importance 
if one wants to determine the distribution of galaxies within 



dark matter haloes in a non-parametric way, i.e. without 
the necessity to rely on any "a priori" assumption. In the 
near future, results from the 2dF and SSDS galaxy redshift 
surveys should be able to fill this gap. 
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